Strong Associations Can Be Weak: Some Thoughts on Cross-lingual Word Webs for Translation
نویسنده
چکیده
This paper discusses the implications of human word association norms on the modelling of word associations from large corpora and the relevance of different types of associations in the process of translation, with a focus on adjectives. It is observed that the proportion of paradigmatic responses found in English norms tends to be higher, whereas a clear preference for syntagmatic associations is exhibited in Chinese norms. Further comparison with corpus-based extracted associations, using various functions in the Sketch Engine, shows that collocational associations might be more effectively extracted, but there is also considerable individual variation for different words. It is suggested that although free associations elicited in isolated context serve to reveal a wide range of potential lexical relations, their usefulness and relevance in real language applications should consider the actual task and its information demand. A purposebased approach to construct cross-lingual word webs for computer-aided translation is thus proposed.
منابع مشابه
Cross-lingual Lexical Similarity in Translation 1 Running head: Cross-lingual Lexical Similarity in Translation Semantic Access in Number Word Translation: the Role of Cross-Lingual Lexical Similarity
The revised hierarchical model of bilingualism (e.g., Kroll & Stewart, 1994) assumes that second language (L2) words primarily access semantics through their first language (L1) translation equivalents. Consequently, backward translation from L2 to L1 should not imply semantic access but occurs through lexical wordform associations. However, recent research with Dutch-French bilinguals showed t...
متن کاملCross-lingual numerical distance priming with second-language number words in native- to third-language number word translation.
An important question in research about bilingualism is whether translation from a second language (L2) to the native language (L1) is semantically mediated or whether it occurs through word associations at the lexical level. Recent research has shown that both L1 and L2 number word translation imply semantic access, suggesting strong L2 lexicosemantic mappings for number words (Duyck & Brysbae...
متن کاملEnglish-Persian Plagiarism Detection based on a Semantic Approach
Plagiarism which is defined as “the wrongful appropriation of other writers’ or authors’ works and ideas without citing or informing them” poses a major challenge to knowledge spread publication. Plagiarism has been placed in four categories of direct, paraphrasing (rewriting), translation, and combinatory. This paper addresses translational plagiarism which is sometimes referred to as cross-li...
متن کاملSparse Bilingual Word Representations for Cross-lingual Lexical Entailment
We introduce the task of cross-lingual lexical entailment, which aims to detect whether the meaning of a word in one language can be inferred from the meaning of a word in another language. We construct a gold standard for this task, and propose an unsupervised solution based on distributional word representations. As commonly done in the monolingual setting, we assume a word e entails a word f...
متن کاملMeasuring the Adequacy of Cross-Lingual Paraphrases in a Machine Translation Setting
Following the growing trend in the semantics community towards models adapted to specific applications, the SemEval-2 Cross-Lingual Lexical Substitution and Word Sense Disambiguation tasks address the disambiguation needs of Machine Translation (MT). The experiments conducted in this study aim at assessing whether the proposed evaluation protocol and methodology provide a fair estimate of the a...
متن کامل